Ador Y. Franco
Isabela State University Cauayan Campus College of Computing Studies, Information and Communications Technology Cauayan City, Philippines
DOI : https://doi.org/10.47191/ijmra/v8-i03-40Google Scholar Download Pdf
ABSTRACT:
Baybayin, an ancient writing system of the Philippines, has been largely forgotten due to the introduction of Latin-based scripts during colonization. However, with the increasing interest in cultural preservation, there is a need for a robust Optical Character Recognition (OCR) system that can accurately recognize and digitize Baybayin text. This study explores the development of a Baybayin OCR system using Convolutional Neural Networks (CNNs) to improve character recognition accuracy. The model is trained on a diverse dataset consisting of handwritten, printed, and synthetic Baybayin characters. Data preprocessing techniques such as image normalization, noise reduction, and data augmentation are applied to enhance model generalization. The CNN model is evaluated using accuracy, precision, recall, and F1-score, achieving an accuracy of 96.2%, significantly outperforming traditional OCR methods such as Tesseract. The trained model is further optimized using TensorFlow Lite (TFLite) for mobile deployment, enabling real-time character recognition via smartphone cameras. The findings of this study demonstrate the feasibility of deep learning-based OCR systems in recognizing ancient scripts, contributing to both cultural preservation and digital accessibility.
KEYWORDS:Baybayin, Optical Character Recognition (OCR), Convolutional Neural Networks (CNNs), Deep Learning, Cultural Preservation, TensorFlow Lite
REFERENCES1) Graves, A., & Schmidhuber, J. (2009). Offline handwriting recognition with multidimensional recurrent neural networks. Advances in Neural Information Processing Systems, 21, 545-552.
2) LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning. Nature, 521(7553), 436-444. https://doi.org/10.1038/nature14539
3) Simard, P., Steinkraus, D., & Platt, J. (2003). Best practices for convolutional neural networks applied to visual document analysis. Seventh International Conference on Document Analysis and Recognition (ICDAR), 958-963. https://doi.org/10.1109/ICDAR.2003.1227801
4) Smith, R. (2007). An overview of the Tesseract OCR engine. Proceedings of the Ninth International Conference on Document Analysis and Recognition (ICDAR), 629-633. https://doi.org/10.1109/ICDAR.2007.4376991
5) Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep learning. MIT Press.
6) Jaderberg, M., Simonyan, K., Vedaldi, A., & Zisserman, A. (2014). Synthetic data and artificial neural networks for natural scene text recognition. arXiv preprint arXiv:1406.2227.
7) Memon, S., Riaz, M. N., Rehman, A., & Khan, M. U. G. (2018). Optical character recognition (OCR) applications: A review and future research directions. International Journal of Advanced Computer Science and Applications (IJACSA), 9(2), 167-181.
8) Doetsch, P., Wigington, C., & Rawls, S. (2019). Deep learning for OCR: Challenges and advances. IEEE Transactions on Pattern Analysis and Machine Intelligence, 41(6), 1331-1345.
9) Shi, B., Bai, X., & Yao, C. (2016). An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(11), 2298-2304.
10) Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, 9(8), 1735-1780.
11) Nayef, N., Pal, U., Koguciuk, D., & Chazalon, J. (2019). ICDAR 2019 robust reading challenge on multi-lingual scene text detection and recognition. Proceedings of the International Conference on Document Analysis and Recognition (ICDAR), 1582-1587.
12) Busta, M., Neumann, L., & Matas, J. (2017). Deep textspotter: An end-to-end trainable scene text localization and recognition framework. Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2323-2331.
13) Ye, Q., & Doermann, D. (2015). Text detection and recognition in imagery: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 37(7), 1480-1500.
14) Lyu, P., Liao, M., Yao, C., Wu, W., & Bai, X. (2018). Mask textspotter: An end-to-end trainable neural network for spotting text with arbitrary shapes. Proceedings of the European Conference on Computer Vision (ECCV), 67-83.
15) Mallari, L. D. (2016). Baybayin: Ancient scripts of the Philippines and its role in cultural preservation. National Commission for Culture and the Arts.
16) Tan, C. M. (2018). The influence of Spanish colonization on the development of Baybayin script. Philippine Journal of Linguistics, 49(2), 123-140.
17) Bautista, F. A. (2017). Baybayin revival: Challenges and prospects for digital preservation. Asian Journal of Humanities and Social Sciences, 5(1), 45-60.
18) Li, H., Wang, P., Shen, C., & Zhang, G. (2017). Show, attend and read: A simple and strong baseline for irregular text recognition. Proceedings of the IEEE International Conference on Computer Vision (ICCV), 740-748.
19) Kingma, D. P., & Ba, J. (2014). Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980.
20) He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 770-778.
21) Howard, A. G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., & Adam, H. (2017). MobileNets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861.
22) Sutskever, I., Vinyals, O., & Le, Q. V. (2014). Sequence to sequence learning with neural networks. Advances in Neural Information Processing Systems, 27, 3104-3112.
23) Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., & Zheng, X. (2016). TensorFlow: A system for large-scale machine learning. Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI), 265-283.
24) Google AI. (2021). TensorFlow Lite for mobile and embedded devices. Google Developer Documentation. https://www.tensorflow.org/lite
25) Jadaone, T. C., Reyes, M. B., & Santos, P. R. (2023). Challenges in automatic recognition of pre-colonial Philippine scripts: A case study on Baybayin. Journal of Southeast Asian Linguistics, 17(2), 88-103.
26) Ligsay, E. V., Rivera, D. R., & Villaverde, K. G. (2022). Enhancing Filipino heritage education through mobile Baybayin OCR applications. International Journal of Language and Cultural Studies, 9(3), 122-138.
27) Oraño, J. D., Pahamotang, M. R., & Malangsa, F. P. (2022). Implementation of TensorFlow Lite for real-time Baybayin recognition on mobile devices. Journal of Mobile Computing and AI Applications, 5(1), 33-50.
28) Pino, R. S., & Mendoza, C. L. (2021). Revitalizing Baybayin: The role of digital technology in preserving Philippine scripts. Asian Studies Journal, 28(4), 105-121.
Volume 08 Issue 03 March 2025

There is an Open Access article, distributed under the term of the Creative Commons Attribution – Non Commercial 4.0 International (CC BY-NC 4.0) (https://creativecommons.org/licenses/by-nc/4.0/), which permits remixing, adapting and building upon the work for non-commercial use, provided the original work is properly cited.
Our Services and Policies
Authors should prepare their manuscripts according to the instructions given in the authors' guidelines. Manuscripts which do not conform to the format and style of the Journal may be returned to the authors for revision or rejected.
The Journal reserves the right to make any further formal changes and language corrections necessary in a manuscript accepted for publication so that it conforms to the formatting requirements of the Journal.
International Journal of Multidisciplinary Research and Analysis will publish 12 monthly online issues per year,IJMRA publishes articles as soon as the final copy-edited version is approved. IJMRA publishes articles and review papers of all subjects area.
Open access is a mechanism by which research outputs are distributed online, Hybrid open access journals, contain a mixture of open access articles and closed access articles.
International Journal of Multidisciplinary Research and Analysis initiate a call for research paper for Volume 08 Issue 04 (April 2025).
PUBLICATION DATES:
1) Last Date of Submission : 26 April 2025.
2) Article published within a week.
3) Submit Article : editor@ijmra.in or Online
Why with us
1 : IJMRA only accepts original and high quality research and technical papers.
2 : Paper will publish immediately in current issue after registration.
3 : Authors can download their full papers at any time with digital certificate.
The Editors reserve the right to reject papers without sending them out for review.
Authors should prepare their manuscripts according to the instructions given in the authors' guidelines. Manuscripts which do not conform to the format and style of the Journal may be returned to the authors for revision or rejected. The Journal reserves the right to make any further formal changes and language corrections necessary in a manuscript accepted for publication so that it conforms to the formatting requirements of the Journal.